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Abstract. In this paper we comment on the results concerning the performances of matched filters, scale adaptive 
filters and Mexican hat wavelet that recently appeared in literature in the context of point source detection in 
Cosmic Microwave Background maps. In particular, we show that, contrary to what has been claimed, the use 
of the matched filters still appear to be the most reliable and efficient method to disantangle point sources from 
the backgrounds, even when using detection criterion that, differently from the classic na thresholding rule, takes 
into account not only the height of the peaks in the signal corresponding to the candidate sources but also their 
curvature. 
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1. Introduction 

Studying diffuse backgrounds in all-sky maps implies the 
possibility of disentangling background signals from those 
originated from point sources. This task is of funda- 
mental importance in dealing with Cosmic Microwave 
Background (CMB) data. In this context various papers 
studied the "optimal" method for such a task. Three main 
methods have been considered so far: the Mexican hat 
wavelet (Cayon et al., 2000), the scale-adaptive filters (or 
optimal pseudo-filters) and the matched filters (Sanz et 
al., 2001, Vio et al., 2002 and reference therein). Matched 
filter (MF) is constructed taking into account the source 
profile and the background to get the maximum signal-to- 
noise ratio (SNR) at the source position. Scale-adaptive 
filter (SAF) is built similarly to MF with the additional 
constraint to have a maximum in filtered space at the scale 
and source position. The Mexican hat wavelet (MHW) 
represents a separate case since it is "a priori" filter, 
adapted to the detection of point sources. Its main limi- 
tation is that it is founded on semi-empirical arguments 
and therefore lacks a rigorous theoretical justification. For 
this reason, in the following we will be especially concerned 
with MF and SAF. 
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Vio et al. (2002) (henceforth VTW) have shown that, 
in spite the claims of "optimality" for SAF and MHW 
(Sanz et al., 2001, henceforth SHM), in reality these fil- 
ters do not behave as good as the MF. In a recent work 
in the context of one-dimensional signals, Barreiro et al. 
(2003, henceforth BSHM) compare SAF, MHW, and MF 
on the basis of a detection criterion based on the Neyman- 
Pearson decision rule, that takes into account not only the 
height of signal peaks but also their curvature. These au- 
thors find that, although MF is effectively optimal in most 
of the cases, there are situations where SAF and MHW 
can overperform it. Here we show that such a result is not 
correct since it is linked to the measure of performance 
adopted by authors, that tends to favour the filters char- 
acterized by a low detection capability. MF is in general 
superior to these other two filters. 

2. Problem Formalization 

For sake of generality, we firstly present our arguments 
in R n and then we specialize the results to the one- 
dimensional case. 

The sources are assumed to be point-like signals con- 
volved with the beam of the measuring instrument and 
are thus assumed to have a profile t(x). The signal y(x), 
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x G R n , is modeled as 

y( x ) = ^2 s o( x )+ z ( x ) 



where 



Sj (x) = Aj t(x -Xj), 



(1) 



(2) 



Aj and Xj are, respectively, unknown source amplitudes 
and locations, and z(x) is a zero-mean background with 
power-spectrum P(q) 



E[z(q)z*(q')]=P(q) 5 n (q-q'). 



(•3) 



Henceforth E[-] and " * " will denote the expectation and 
complex conjugate operators, respectively, S n (q — q') the 
n-dimensional Dirac distribution, and z{q) the Fourier 
transform of z{x) 



z(q) = f 

J —l 



z{x) c- tq x dx. 



(4) 



To properly remove the point sources from the signal it is 
necessary to estimate the locations {xj} and amplitudes 
(fluxes) {Aj} of the sources. 

The classic procedure for the detection of the sources 
consists in filtering signal to enhance the sources with re- 
spect to the background. This is done by cross-correlating 
the signal y(x) with a filter The source locations are 
then determined by selecting the peaks in the filtered sig- 
nal that are above a chosen threshold. Finally, the source 
amplitudes are estimated as the values of the filtered signal 
at the estimated locations. The question is the selection 
of an optimal filter $ for such procedure. In order to de- 
fine it, some assumptions are necessary. In particular it is 
assumed that the source profile and background spectrum 
are known, the profile is spherically symmetric, character- 
ized by a scale R Sl and the background is isotropic. These 
assumptions allow to write s(x) = s(x), where x = \\x\\, 
and P{q) = P{q) for q = \\q\\. In addition, source overlap 
is assumed negligible. In the present context, we are inter- 
ested in the general family of spherically symmetric filters 
$(x;b) of the form $(x; b) = 4>{\\x - b\\). The cross- 
correlation between Q(x; b) and y(x) provides a filtered 
field w(b; <fi) with mean /i(6; <p) and variance a 2 : (</>). 

2.1. Matched filters (MF) 

Source locations are assumed to be known and the aim is 
to estimate the amplitudes. Given the assumed distance 
between the sources, it is enough to consider a field y(x) 
as in 0} with a single source at the origin, s(x) = At(x). 
Its amplitude w(0;<p) is estimated by requiring it to be an 
unbiased estimator of A, i.e., /i(O;0) = A. On the other 
hand, to enhance the magnitude of the source relative to 
the background the filter $ is required to minimize the 
variance cr 2 (4>). This has the effect of maximizing, among 
unbiased estimators, the detection level 



v(<p) 



MO; 



a{4>) 



(5) 



which measures the capability of the filter to detect cor- 
rectly a source at the prescribed location. 

Since $ is chosen in a way that w(0; (p) is a minimum 
variance linear (in y(x)) unbiased estimator of A, it follows 
that (Gauss-Markov theorem) w(0; </>) is the (generalized) 
least squares estimate of A achieved by the filter 



m = TaWr 

with minimum variance 



<1 



l ^dq, 



<? 2 (4>) = T , 
oa 



(6) 



(7) 



where S = 2 W 2 T~ 1 (n/2). In other words, filter ©, 
calle d matched filter, optimizes the signa l-to-noise ratio 
le.g.. lKozma fc Kellev lilflfij IPra,tt 111 PQlh . Although this 
filter is commonly used for signal detection, it is not the 
only approach to define an " optimal filter" . A possible 
alternative is represented by the Wiener-filters that, how- 
ever, are designed to m inimize the prediction e rror given 
covariance information l|Rabiner fc Gold1ll975j) . 

2.2. Scale adaptive filters (SAF) 

In the pseudo-filter approach of SHM the filters have the 
same form of $(cc; b) with an additional scale dependence 



*{x;R,b) = -^^ 



R ' (8) 

for some spherically symmetric function %p. The cross- 
correlation between this filter at scale R and y(x) provide 
a filtered field w(b, R; ip). 

To determine an optimal filter ip, SHM minimize the 
variance of the filtered field with the two constraints: 
w(0, Ro;ip) is required to be, as in the previous section, 
an unbiased estimator of A for some known Rq w R s , and 
ip is selected so that /i(0, R; -0) has a local maximum at 
scale i?o- This latter translates into 

/> + OC / 7 1 \ 

J o ? n -V(g) ij{Roq) [n + dq = 0. (9) 

Minimizing a 2 (Rq; ip) with the two constraints yields the 
filter (SHM) 



iP(R q) 
where A 



S A P(q) 
ac — b 2 , 

b = 



nb + c — (na + b) 



dlnr(g) 



d\nq 



(10) 



dr 



1 n 



P dlnq' 
1 / dr 



q 



(ii) 



dq. 



lo P \d\nq 

and a is as in @. This filter provides a field of variance 

n 2 a + 2nb + c 



SA 



(12) 



and an estimator of the amplitude A that is again linear 
and unbiased. 
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3. Filter comparison 



the calculation of the quantities 



In their work VTW stress the fact that, since both <!> and 
^ provide a linear and unbiased estimate of the amplitude 
A then, regardless the source profile and background spec- 
trum and because of the optimality of the least squares, 
a 2 (Ro;tp) > cr 2 (0). As a consequence the value of the de- 
tection level, V, corresponding to $ is at least as high, or 
higher, than that achieved with Furthermore, via an ex- 
tensive set of numerical simulation VTW have shown that 
this conclusion holds even when the source location uncer- 
tainty is taken into account. In other words, enough infor- 
mation about the scale of the source is already included in 
the derivation of the matched filter. Via numerical simu- 
lations VTW have also shown that MF overperforms SAF 
when comparing the resulting numbers of incorrectly de- 
tected sources. VTWs conclusion is then nothing is gained 
by using SAF. 

Recently, in the context of one-dimensional signals, 
zero-mean Gaussian background with scale-free power 
spectrum P(q) = Dq^ 1 , and Gaussian profile t(x) = 
Ae~ x / 2Rq for the source, BSHM criticized these conclu- 
sions through the argument that the detection level T> and 
the nu thresholding method used by VTW as detection 
rule are not sufficient to support their results . For this 
reason, they introduce a new detection criterion based on 
a Neyman-Pearson decision rule which uses not only the 
heigth of the maxima in the signal but also their curva- 
ture. This method can be summarized as follows (for more 
details, see BSHM) 

3.1. First case: fixed source amplitudes 

If the ID background z(x) is Gaussian, then it is possi- 
ble to estimate the expected total number density rib of 
maxima (i.e., number of maxima per unit interval in x) as 
well their expected number density nb(v,n) per intervals 
(v, v+dv) and (n, K+dn), where v = z/ o~ and n = — z"/cr 2 
are the normalized field and curvature, respectively. Here, 
c 2 is the moment of order 2n associated with the field. If 
all the sources are assumed to have the same amplitude 
A, it is possible to estimate the corresponding quantities n 
and n(v, n\v s ), v s — A/<jq, when the sources are embedded 
in the background. These quantities allow to calculate, for 
any region i?»(z^, k), the probability density functions 



p b (u, k\0) 



rib(v, k) 
rib 



p(v, k\v s 



n(y, k) 



(13) 



that can be interpreted as the probability that a given 
maximum is due to the background or to a local source, 
respectively. In their turn, pb(v, k\0) and p(y, n\v s ) allow 



1 Here it is necessary to stress that, contrary to what written 
by BSHM, the superiority of MF with respect to SAF is claimed 
not only on the basis of the larger value of T> and of the number 
of correct detections, but also on the better detection capability 
for a given average number of incorrect detections. 



Pbiy, n\§)dvdn 



R, 



1-/3 



p(v, n\v s )dvdK,, 



(14) 



(15) 



R. 



that provide the so called false alarm probability (i.e., the 
probability of interpreting noise as signal) and the power 
of the detection (i.e., (3 represents the probability of inter- 
preting signal as noise) . i?» is called the acceptance region. 

In order to obtain a detection criterion, BSHM intro- 
duce the significance s 2 



mal 



tmal 



^signal ~^ ^no- signal 



(16) 



where in the numerator appears the difference between the 
mean number of peaks in N different realizations of the 
background, in presence and absence of signal: (1 — (3)N 
and aN, respectively. cr 2 ignal and cr 2 „ signal represent the 
variances of corresponding quantity (N) . It is not difficult 
to show that 



! (a, (3) oc 



(1 -13-af 



0(1 - 0) + a(l - a) 



(17) 



The idea of BSHM is to maximize s 2 with respect to a 
with the constraint that R* is defined by the the Neyman- 
Pearson decision rule. The reason is that the acceptance 
region 

L M »s) = P -^^>L. (18) 

provides the highest power 1 — f3 for a given confidence 
level a. is a constant: if L > a source is present, 
whereas if L < L* no source is present. Eqs. (CHJl and ljT%)l 
allow to exchange the maximization of s 2 with respect to 
a with the maximization with respect to L*. 

It happens that for SAF, MF, and MHW, and inde- 
pendently from the index 7, s 2 is maximized for L, «1. 
Fig. ^ shows the corresponding for sources with an 
amplitude A such as v s — 3 after filtering with SAF. This 
figure shows that, at variance with SAF and MHW, the 
acceptance region of MF does not depend on the curva- 
ture « but only on the height of the maxima. Therefore, 
for MF the detection rule proposed by BSHM provides a 
criterion similar to the classic no~ thresholding rule. 

Once fixed i?» it is possible to calculate the expected 
number density of incorrect and the expected number 
density n* of correct detections by integrating nl(is, k) and 
n(v, nl^s) over i?*. These quantities are used by BSHM to 
calculate the ratio r = n*/rib, called reliability, and the 
quantity 

D = n ~ rMF x 100, (19) 

fMF 

where subindex i refers to the different filters, that these 
authors use as a measure of the performances of MF, SAF, 
and MHW. Fig.Elshows nl,n* ,r and D, as function of the 
index 7. Essentially on the basis of quantity r BSHM claim 



4 



R. Vio, P. Andreani, & W. Wamsteker: Optimal Detection of Sources 



SAF 
MF 
MHW 

0.5 



Fig. 1. Acceptance region i?*, when 7 = 1.4, for SAF, MF, 
and MHW for sources with aplitude A such that v s = 3 
after filtering with SAF. The maxima with v and n above 
the corresponding line are accepted as sources and those 
below are rejected. 



that for 7 < 1 and 7 > 1.6 MF overperforms SAF, whereas 
the contrary holds for 1 < 7 < 1.6. These conclusions 
deserve some comments. 

First, similarly to VTW and in spite of the introduc- 
tion of the new detection criterion, BSHM find that in 
most situations the use of the second constraint © in 
SAF is not only useless but even harmful. Second, if, on 
the one hand, the superiority of MF for 7 < 1 and 7 > 1.6 
is out of discussion (this filter provides the largest number 
of correct detections and the smallest number of incorrect 
ones), the same conclusion for SAF when 1 < 7 < 1.6 is 
questionable. In this range of 7 SAF provides a smaller 
number of incorrect detections, but at the same time also 
a smaller number of correct ones. In this respect, at least 
in principle, the reliability parameter r should be used as a 
measure of the filter performances only when an incorrect 
detection has a larger "cost" than missing a source, a fact 
that has to be proved in the context of CMB. Furthermore, 
even in the case of an high "cost" for the incorrect detec- 
tions, r has to be used with great care. The reason is that 
MF is constructed in such a way to maximize source de- 
tections. Therefore, the maximization of s 2 with respect 
to I/* provides a criterion favouring the detection of a true 
source rather than the rejection of a false one. 

If one is worried of incorrect detections, there is a sim- 
ple cure: the choice of a L* making the detection of the 
sources less efficient. In this way, part of the correct de- 
tections will be lost but also the number of incorrect de- 
tections will decrease. Furthermore, in case of sources em- 
bedded in the background the signal peaks are expected 
to have a mean height larger than that expected in case of 
only background signal. Therefore, the smaller detection 
efficiency will affect more the number of incorrect detec- 
tions than that of the correct ones. This fact is shown in 



Fig.|21where it is evident that, when 7 = 1.4 and L* « 1.2, 
MF has the same number density of incorrect detections 
as SAF with L* = 1 but still a larger number density of 
correct detections and consequently a larger reliability r. 
The conclusion is that, as done in VTW, a meaningful 
evaluation of the performances of the two filters requires 
that the comparison is made by fixing the number density 
of incorrect (or alternatively, correct) detections. If n£ is 
set at the value of SAF for L* = 1, the quantity r shown 
in Fig. 01 indicates that MF is better than SAF also for 
1 < 7 < 1-6. 

Similar arguments hold also for MHW that BSHM 
claim to provide a slightly better performance than MF 
when 7 > 2. Fig. shows again that this conclusion is not 
correct. 

3.2. Second case: random source amplitudes 

The arguments presented in the previous section have 
been developed under the hypothesis that all the sources 
are characterized by the same amplitude A. Of course, 
this condition is not satisfied in real situations. In order 
to solve this problem, BSHM suggest to substitute the 
likelihood ratio H18|l with: 



L{v, k) 



p(v, k) 



where 



p(i/,K|0)' 
p(v,k) = I p(v,n\v s )p{v s )dv s , 



(20) 
(21) 



and p(v s ) is the probability density function of the nor- 
malized amplitudes. After that, the method of estimat- 
ing the quantities n 



b> 



r, and D proceedes in the 
same way as in the previous section with the only excep- 
tion that n{v, k\v s ) has to be substituted with n(y,K) = 
J n(y, K\v s )p{v s )dv s . By assuming that p(v s ) is an uniform 
probability distribution function with values in the range 
[0, i/ c ], BSHM arrive to results similar to those they ob- 
tained for the case with fixed source amplitude. Again, 
this deserves some comments. 

The first, and most obvious, is that such a conclusion 
suffers the same limitation found in the previous section. 
Consequently the claim of superiority of SAF and MHW 
over MF is again not founded. The second comment is 
that, in order to obtain reliable results, p{v s ) is needed to 
be known with good accuracy: the use of a wrong p(v s ) 
will end in a false rule according to which p{v s ) overweighs 
the smallest amplitudes or the largest ones, favouring the 
(correct and incorrect) detections or the (correct and in- 
correct) rejections with obvious consequences on the "op- 
timality" of the method. In the framework of CMB studies 
the a priori information on p{y s ) is not available or is very 
inaccurate. The consequence is that a simple detection 
rule as, for example, the 3cr thresholding criterion could 
still represent the best choice since it requires the only a 
priori knowledge of the noise level. This approach is much 
simpler and safer than estimating the distribution of the 
source amplitudes. 
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Fig. 2. Relationship 7 vs. the number density nl of incor- 
rect detections, the number density n* of correct detec- 
tions, the reliability r, and the relative detection ratio D 
corresponding to SAF, MF, and MHW for L* = 1. 
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Fig. 3. Relationship L* vs. the number density nl of in- 
correct detections, the number density n* of correct de- 
tections, the reliability r, and the relative detection ratio 
D corresponding to MF for 7 = 1.4. For comparison, the 
corresponding levels of SAF for L* = 1 are plotted too. 



4. Summary and Conclusions 

This paper deals with the detection techniques to extract 
point-sources from Cosmic Microwave Background maps. 
Various recent works appeared in the literature, present- 
ing new techniques with the aim to improve the perfor- 
mances of the classical matched filters (MF). In particu- 
lar the scale adaptive filters (SAF) and the Mexican hat 
wavelet (MHW) have be en proposed as th e most efficient 
and reliable methods fsee lSanz et al. 2001, and references 
therei n). This claim was subject to criticism bvlvio" et al. 
l|2002j) since they showed that in reality SAF and MHW 




Fig. 4. Relationship between 7 vs. the number density n* 
of correct detections and the reliability r corresponding to 
SAF and MF when for both filters the number density of 
incorrect detections is fixed to the value nl of SAF with 
L* = 1. 
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Fig. 5. Relationship 7 vs. the number density n* of correct 
detections and the reliability r corresponding to MHW and 
MF when for both filters the number density of incorrect 
detections is fixed to the value n'l of MHW with L* — 1. 



have performances that in general are inferior to those 
provided by MF. 

Recently iBarreiro et al. I l|2003j) used the argument 
that a criterion making use of a simple na thresholding 
rule is not fu lly sufficient to claim detection. To support 
this assertion lBarreiro et al. l |2003'). in the context of one- 
dimensional signals and sources with Gaussian profiles, 
adopt a detection criterion based on a Neyman-Pearson 
decision rule that makes use of both the height and the 
curvature of the maxima in the signal. Their theoretical 
arguments and numerical simulations indicate that, al- 
though in general MF still remains the filter with the best 
performances, there are situations where SAF and MHW 
overperform it. In this paper we show that this conclusion 
is again not correct since it is basically founded on a per- 
formance test favouring the filters characterized by a low 
detection capability. This means that there is no reason 
to prefer SAF or MHW to MF. Furthermore, the claimed 
superiority of SAF and MHW, when the source scale has 
to be estimated from the data, has still to be proved, and 
in principle also MF could be modified in such a way to 
efficiently deal with this situation. 

These conclusions are not academic: the use of non- 
standard statistical tools is indicated only in situations of 
real and sensible improvements of the results. New tech- 
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niques that do not fulfill this requirement should be in- 
troduced with care: they prevent the comparison with the 
results obtained in other works and may lead people to use 
not well tested methodologies (MF has been successfully 
used for many years in very different scientific contextes) 
ending up in not reliable results. Moreover, in the present 
context, the use of SAF introduces further complications 
in the analytical form of the filters (e.g., compare Eq. © 
with Eq. and in the definition of the detection rule 

(for MF the calculation of the curvature n of the peaks in 
the signal is not required). 
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